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abstract: Activated factor V (Va) serves as an essential protein cofactor for the conversion of prothrombin 
to thrombin by factor Xa. Analysis of the factor V cDNA indicates that the protein contains several types 
of internal repeats with the following domain structure: A1-A2-B-A3-C1-C2. In this report we describe 
the isolation and characterization of genomic DNA coding for human factor V. The factor V gene contains 
25 exons which range in size from 72 to 2820 bp. The structure of the gene for factor V is similar to the 
previously characterized gene for factor VIII. Based on the aligned amino acid sequences of the two proteins, 
21 of the 24 intron-exon. boundaries in the factor V gene occur at the same location as in the factor VIII 
gene. In both genes, the junctions of the A1-A2 and A2-A3 domains are each encoded by a single exon. 
In contrast, the boundaries between domains A3-C1 and C1-C2 occur at intron-exon boundaries, which 
is consistent with evolution through domain duplication and exon shuffling. The connecting region or B 
domain of factor V is encoded by a single large exon of 2820 bp. The corresponding exon of the factor 
VII I gene contains 3 106 bp. The 5' and 3' ends of both of these exons encode sequences homologous to 
the carboxyl-terminal end of domain A2 and the ammo-terminal end of domain A3 in ceruloplasmin. There 
is otherwise no homology between the B domain exons. These data provide further insight into the evolutionary 
relationships within this family of related plasma proteins and provide a basis from which to begin the 
investigation of the cellular regulation of factor V biosynthesis and characterization of molecular defects 
in congenital factor V deficiency. 



he generation of thrombin by the prothrombinase complex 
is a critical event in hemostasis and thrombosis. This complex 
consists of factor Xa, factor Va, prothrombin, calcium, and 
a phospholipid or cellular surface. Factor Va participates as 
an essential protein cofactor in the activation of prothrombin 
by factor Xa (Kane & Davie, 1988). Most of the factor V 
in whole blood circulates in the plasma; however, 25% of the 
factor V in blood is stored in platelet a-granules (Tracy el al., 
1982). Platelet factor V is released during platelet activation 
and appears to be important for platelet surface prothrombin 
activation and normal hemostasis (Miletich el al., 1978). The 
major sites of factor V biosynthesis appear to include both the 
liver (Wilson et al., 1984) and megakaryocytes (Gewirtz et 
al., 1986). We (Kane & Davie, 1986; Kane et al., 1987) and 
others (Jenny et al., 1987) have previously determined the 
predicted amino acid sequence of human factor V by cDNA 
cloning. The mature protein consists of 2196 amino acids. 
Analysis of the factor V cDNA indicates that the protein 
contains several types of internal repeats organized with the 
following domain structure: A1-A2-B-A3-C1-C2. The A, 
B, and C domains each contain approximately 350, 836, and 
1 50 amino acids, respectively. The primary sequence of human 
factor V is 40% identical to human coagulation factor VIII 
except in the B domain where there is no homology (Toole et 
al., 1984; Vehar et al., 1984). The role of factor Villa in the 
activation of factor X is analogous lo the role of factor Va in 
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prothrombin activation. Thus, the factor X activation complex 
consists of factor IXa, factor Villa, factor X, calcium, and 
a phospholipid or cellular surface. The A domains present in 
factor V and factor VIII are approximately 30% identical to 
the triplicated A domains present in the plasma copper-binding 
protein ceruloplasmin (Ortel et al., 1*984). The C domains 
in factor V and factor VIII are also approximately 40% 
identical to the duplicated C domains present in a recently 
characterized murine breast epithelial cell protein (Slubbs et 
al., 1990). 

The amino acid sequence homology among this family of 
proteins suggests that the genes for these proteins have evolved 
through a process of gene duplication and exon shuffling 
(Gilbert, 1985). The gene for factor VIII has been charac- 
terized (Gitschier et al., 1984), and it spans > 180 kb on human 
chromosome Xq28. One striking feature of the factor VIII 
gene is that the entire B domain or connecting region is en- 
coded by a particularly large exon of 3106 bp. The gene for 
human factor V has been localized to human chromosome 
lq2l-25 (Wang et al., 1988). Recent gene mapping studies 
(Watson et aL, 1990) indicate that the gene for human factor 
V is located within a 300-kb region that also includes the genes 
for the selectin family of leukocyte adhesion molecules. We 
now report the cloning and characterization of the gene for 
human factor V. Our data provide evidence that gene du- 
plication and exon shuffling occurred during the evolution of 
this family of related plasma proteins and provide a basis from 
which to begin the investigation of the cellular regulation of 
factor V biosynthesis and characterization of molecular defects 
in congenital factor V deficiency (parahemophilia). 

Materials ano Methods 

Materials. All restriction enzymes and DNA-modifying 
enzymes were purchased from Bethesda Research Laboratories 
or United States Biochemicals. 32 P-Labeled nucleotides and 
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[or- 35 S]dATPaS were obtained from Amersham. The Se- 
quenase Version 2.0 DNA sequencing kit was obtained from 
United States Biochemicals. DNA amplification kits were 
obtained from Perkin-Elmer Cetus. 

Isolation of Genomic Clones for Factor V. A human lym- 
phocyte genomic library in \EMBL 3 (Clontech) and a human 
lung fibroblast genomic library in XFIX (Stratagene) were 
screened by the in situ plaque hybridization method of Benton 
and Davis (1977). A full-length factor V cDNA probe (Kane 
et al., 1990) was used to screen approximately 1 X 10 6 re- 
combinant phage from the XFIX library. Approximately 8 
X 10 5 phage from the XEMBL3 library were screened in a 
similar fashion using a restriction fragment (EcoRl-Accl) 
corresponding to nucleotides 1-269 of the factor V cDNA. 
The cDN A probes were labeled with [ 32 P]dCTP to a specific 
activity of > 10* cpm/pg in the presence of random hexanu- 
cleotidc primers and DNA polymerase I (Feinberg 8l Vo- 
gelstcin, 1983). Positive genomic clones were plaque purified 
using standard methods (Sambrook et al., 1989). 

Southern Blot Analysis of Genomic Clones and Human 
Genomic DNA. Phage DNA from the genomic clones and 
high molecular weight human genomic DNA from human 
peripheral blood leukocytes were isolated using standard 
methods (Sambrook et al., 1989). The DNA was digested with 
the restriction enzymes Sst\ and/or Xbal, separated on 0.7% 
agarose gels, and transferred to nitrocellulose membranes 
(Southern, 1975). The membranes were then hybridized with 
3 -P-labeled factor V cDN A probes or oligonucleotides in order 
to determine the location of exons and to verify that no re- 
arrangements occurred during the construction of the phage 
library. The filters were washed for 30 min at moderate (0.2X 
SSC, 1 65 °C) or high (0.2X SSC, 70 °C) stringency, and 
autoradiographs were exposed using XAR-5 film with inten- 
sifying screens (Du Pont). Restriction fragments isolated from 
Sstl or Xbal digests of the genomics clones were subcloned 
into pBluescript (Stratagene). These plasmid subclones were 
then mapped by restriction enzyme digestion and Southern 
blotting. These data were used to construct a genomic re- 
striction map for the enzymes Sstl and Xbal. The gene map 
was confirmed by characterization of at least two independent 
clones for the gene and by Southern blot analysis of human 
genomic DNA. 

DNA Sequence Analysis of the Isolated Genomic Clones. 
DNA sequences were determined by the dideoxy chain ter- 
mination method (Sanger et al., 1977) using either universal 
or synthetic oligonucleotide primers. Single-stranded templates 
were prepared using M13mpJ8/19 or pBluescript (Sambrook 
et al., 1989). Plasmids employed for double-stranded se- 
quencing were purified by cesium chloride banding (Sambrook 
et al., 1989). Approximately 95% of the DNA sequence for 
exons and exon-intron boundaries were sequenced on both 
strands. Sequences that deviated from previously published 
cDNA sequences were verified on independent templates on 
both strands. DNA sequences were stored and analyzed using 
a Macintosh cx computer and Mac Vector version 3.5 software 
(IBI). In most cases, exons were aligned to the genomic Xbal 
and Sstl map using restriction sites present in the exon or 
exon-intron boundary sequence. The locations of exons 15-18 
were determined by PCR (Saiki et al., 1988) using the fol- 
lowing pairs of oligonucleotide primers: 5'-ATACCT- 
ACGTATGGCATG-3' (exon 15) and 5'-G AGTCTCCAA- 
GAACTTCG-3' (exon 16); 5'-ATTATCAG A AG- 
AGCAAGG-3' (intron 16) and 5'-CA- 

1 Abbreviations: SSC, 0.01 M sodium cilrnie buffer. pH 7.4,0.15 M 
sodium chloride; PCR, polymerase chain reaction. 
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GATTGCCTTTTCCCTG-3' (intron 17); 5'-GGGAT- 
GATCTACAGCTTG-3' (exon 17) and 5'-TAGTAA- 
TAGGCACTCTCC-3' (intron 18). Typically, reactions 
contained 100 ng of a bacteriophage clone template, 1 M M each 
oligonucleotide, and 2.5 units of Taq\ polymerase. The DNA 
was amplified for 40 cycles of denaturation (1 min at 94 °C), 
annealing (2 min at 45-60 °C), and extension (2 min at 72 
°C) using a Coy thermal cycler. The magnesium chloride 
concentration was varied between 1 and 6 mM in order to 
optimize results. Reaction products were analyzed on \% 
agarose gels with ethidium bromide staining. 

Results and Discussion 

Isolation and Characterization of Factor V Genomic Clones. 
Forty-five genomic phage clones were isolated from the XFIX 
library using a 32 P-labclcd full-length factor V cDNA probe. 
These clones contained inserts ranging from 12 to 30 kb. 
Preliminary analysis indicated that 24 of these isolates rep- 
resented unique overlapping clones. Southern blotting and 
partial DNA sequence analysis indicated that the sequences 
5' to nucleotide 327 in the factor V cDNA were not present 
in these clones. Subsequently, five clones were isolated from 
the XEMBL3 library by hybridization with a probe containing 
nucleotides 1-269 of the factor V cDNA. Restriction and 
Southern analysis of these clones demonstrated that three were 
unique overlapping clones with inserts ranging from 12 to 16 
kb. Partial DNA sequence analysis of these clones indicated 
that they contained the sequences corresponding to nucleotides 
1-326 of the factor V cDNA. Hybridization experiments 
demonstrated no overlap between the two sets of overlapping 
clones (see below). 

Structure of the Human Factor V Gene. Twenty of the 
independent genomic clones were chosen for detailed char- 
acterization. Restriction fragments from the eight clones 
shown in Figure 1 were also subcloned into pBluescript and 
characterized as described under Materials and Methods. 
Characterization of Sstl and or Xbal digests of these clones 
or plasmid subclones by Southern blotting provided the data 
used to construct a restriction map of the factor V gene for 
these enzymes. It is possible that some small genomic frag- 
ments (<250 bp) containing intron sequences may not have 
been detected in our analysis. Restriction mapping of the 
genomic clones indicated that the gene was comprised of at 
least 17 Sstl fragments ranging in size from 0.7 to 12 kb 
(Figure 1). Fourteen of these fragments contained exons as 
judged by Southern blotting experiments. No genomic clones 
completely spanned the second intron. The sizes of the cloned 
genomic Sstl fragments were consistent with the size of the 
fragments observed when human genomic DNA was probed 
with factor V cDNA probes. This indicates that no major 
rearrangements occurred during the construction of the ge- 
nomic libraries. When genomic DNA was probed with factor 
V cDNA probes encoding the heavy chain (nt 1-2074) or the 
B domain (nt 2268-4785), only the predicted hybridizing 
fragments were observed consistent with a single copy of the 
factor V gene. However when genomic DNA was probed with 
probes encoding the light-chain region (nt 5001-6903, 
6000-6590, or 6591-6903), several additional hybridizing 
bands were observed after high stringency washes. 2 These 
additional bands may represent genomic DNA that is sequence 
related to the factor V gene. 

The structure of the human factor V gene is shown in Figure 
1. The human factor V gene spans >80 kb of DNA. The 

2 L. D. Cripc and w*. II. Kane, unpublished observations 
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figure 1: Organization of the human coagulation factor V gene. The size scale in kilobases (kb) is shown at the lop of the figure. The second 
line is schematic representation of the human factor V gene drawn to scale with exons indicated by the rectangles connected by lines representing 
the introns. The number for each exon is indicated. Genomic restriction sites for Sst\ and Xba\ arc depicted below. The order of the eight 
Xba\ fragments that are bracketed was not determined. The location for eight of the unique overlapping human genomic clones is indicated 
by the solid lines at the bottom of the figure. The identity of each clone is shown at the right or the bar, and the vertical lines indicate the 
location of genomic Sst\ sites contained in each clone. Clone XH VG63 was obtained from the XEMBL 3 library; the remainder of the clones 
were isolated from the XFIX library. There is no overlap between clones XHVG63 and XHVG20. This gap occurs in intron 2 of the factor 
V gene and it is indicated in the map by the interrupted lines. 



nucleotide sequences of exons and exon-intron boundaries were 
sequenced by the dideoxy chain termination method. The gene 
consists of 25 exons which encode the factor V mRNA as 
determined by comparison with the cloned cDNA (Kane & 
Davie, 1986; Kane et al M 1987; Jenny et al., 1987). The exons 
range in size from 72 to 2820 bp (Table I). The genomic 
clones include -5.4 kb of DNA 5' to exon 1 and -7 kb 3' 
to exon 25. The introns in the factor V gene range in size from 
0.4 to > 1 1 kb (Table II). There is no cross-hybridization 
between clones XHVG63 and XHVG20 indicating that these 
clones are missing a portion of intron 2. Splice donor and 
acceptor sequences agree with the GT-AG rule and conform 
to the consensus sequence of Mount (1982) except for intron 
6. Intron 6 agrees with the consensus sequence except that 
the donor-acceptor sequence is GC-AT. This variant donor 
splice site sequence accounts for the majority of nonconforming 
donor splice sites in nonimmunoglobulin genes (Shapiro & 
Senepathy, 1987). 

Comparison of Factor V Genomic and cDNA Sequences. 
The sequence of the factor V genomic clones is in excellent 
agreement with the two previously published cDNAs (Kane 
& Davie, 1986; Kane et al., 1987; Jenny et al., 1987). There 
are seven positions with differences between the genomic se- 
quence and the two published cDNAs (Table HI). In two 
cases the differences occur in domain A I at nucleotides 481 
and 628. These differences are silent in that they do not affect 
the predicted amino acid sequence; however, our previously 
reported cDNA (Kane et al., 1987) contains a Pst\ site at 
position 477 which is not present in the genomic clones or the 
cDNA sequence reported by Jenny (Jenny et al., 1987). The 
five remaining differences occur within the B domain of factor 
V. Four of these differences result in amino acid substitutions. 

The human factor V cDNAs isolated by Kane (Kane ct al., 
1987) and Jenny (Jenny et al., 1987) include 5' untranslated 
regions of 76 and 90 nucleotides, respectively. The first 20 
nucleotides of these sequences arc not identical. Comparison 
of the cDNA sequences to the genomic sequence helps to 
explain these differences. The sequence reported by Jenny 
(Jenny cl al., 1987) includes the sequence GAATTCCG which 
is not present in the gene (Figure 2). This sequence corrc- 



Tablc 1: Exon Organization of the Human Factor V Gene" 



exon 


cDNA 


length 


amino 
acids 


domain 


1 


14-234 


221 


53 


5' untranslated sequence, 










signal sequence, Al 


2 


235-326 


90 


30 


Ai 


3 


327-449 


124 


41 


Al 


4 


450-662 


212 


71 


Al 


5 


663-806 


144 


48 


Al 


6 


807-1028 


221 


74 


Al 


7 


1029-1194 


166 


56 


AI-A2 


8 


1195-1372 


178 


59 


A2 


9 


1373-1472 


99 


33 


A2 


10 


1473-1687 


215 


72 


A2 


1 1 


1688-1838 


151 


50 


A2 


12 


1839-2051 


213 


71 


A2 


13 


2052-4872 


2820 


940 


A2-A3, connecting region or 










B domain, 18 and 9 amino 










acid tandem repeats, 










thrombin cleavage sites 


14 


4873-5047 


175 


59 


A3 


15 


5048-5284 


237 


79 


A3 


16 


5285-5495 


21 I 


70 


A3 


17 


5496-5675 


180 


60 


A3 


18 


5676-5792 


117 


39 


A3 


19 


5793-5864 


72 


24 


Cl 


20 


5865-5968 


104 


35 


CI 


21 


5969-6124 


156 


52 


Cl 


22 


6125-6269 


145 


48 


Cl 


23 


6270-6421 


152 


51 


C2 


24 


6422-6604 


182 


61 


C2 


25 


6605-6890 


286 


48 


C2, 3' untranslated sequence 



a Numbering of nucleotide residues and protein domains corresponds 
to the previously published sequences (Kane & Davie, 1988). Amino 
acids with interrupted codons were assigned to the exon containing two 
of the three codon nucleotides. 

sponds to the EcoRl linker used to construct the cDNA library. 
The sequence of the Jenny cDNA following the linker agrees 
completely with the genomic sequence. The first 1 1 nucleotides 
of the cDNA sequence that we reported previously do not 
agree with the genomic sequence reported here. This dis- 
crepancy appears to be an artifact that occurred near adjacent 
Alu\ sites during the construction of the cDNA library. Thus, 
nucleotides 1-16 of our cDNA correspond to the comple- 
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» — 

5'-C TCCGGGCTGT CC CAGCT CGG CAAGCGCTGC-3 • Q 

5 • - GAATTCCG CAGCCCG GAGTGTGGTT AGCAGCTCGG CAAGCGCTGC-3' 

5 1 -TCTAGAGCTC ATTG CAGCT G GGACAGCCCG GAGTGTGGTT AG CAGCT CGG CAAGCGCTGC-3' - 
3 1 -AGATCTCGAG TAACGTCGAC CCTGTCGGGC CTCACACCAA TCGTCGAGCC GTTCGCGACG-5 ' ^ 



FIGURE 2: Comparison of the factor V genomic sequence with 5' untranslated regions of published cDN A sequences. The sequences corresponding 
to the cDNA reported by (a) Kane (Kane et al., 1987), (b) Jenny (Jenny et al., 1987), and (c) the genomic sequence reported here are listed. 
Sequences corresponding to the £coR\ linker are shown in bold print. Alu\ restriction sites arc underlined. The first 18 nucleotides of the 
cDNA reported by Kane (Kane et al., 1987) differ from the sequence reported by Jenny (Jenny et al., 1987) but are identical to complementary 
strand genomic sequence (arrows). 



Table II: Intron-Exon Splice Junctions of the Human Factor V Gene" 


Intron Exon Intron 


Exon 


Intron 


Intron 






size 


Type 






(kb) 





1 


TCAAG 


GTAACTCA. . . . 


TTATTTCCAG 


TTTGAATC 


3 


.8 


II 


2 


TTCAG 


GTAAGAAC .... 




GACTTCTT 


>10 


1 


3 


AGAAG 


GTAAGATA. . . . 


ACTCCTCCAG 


GTGCTTCT 


11 


I 


4 


AAAAG 


GTAAGAAC .... 


CCTTTCATAG 


GGACCCTA 


2 


.2 


I 


5 


GCCAG 


GTAACACG .... 


CTTTTCTCAG 


ATATAACA 


1 


.9 


I 


6 


GCAAG 


GCAAG AAA. . . . 


CCTAACTCAG 


CTGGGATG 


1 


.4 


I 


7 


GACAA 


GTGAGTTG. . . . 


TCACTTTAAG 


AAAATACA 


2 


. 4 


II 


8 


TCAAA 


GTAGTAAC .... 


ATTACTTGAG 


ATCGTGTT 


1 


.9 


0 


9 


CTCAG 


GTTTGAAT .... 




GCAGGAAC 


0 


. 6 


I 


10 


TACAG 


GTACTTTT .... 


.... TGTCCTCCAG 


AGGGCAGC 


3 


. 1 


0 


11 


GAGCA 


GTAAGTCA. . . . 


.... TCACTTTCAG 


CTATCAAT 


2 


.2 


I 


12 


TGTTG 


GTGAGTAA. . . . 




GAACTTGG 


1 


. 1 


I 


13 


CAAAG 


GTTTGGCC .... 


.... ATTATTTCAG 


GGAAACAG 


2 


.9 


II 


14 


TCCAA 


GTAAGTCT. . . . 


.... CTTTTCCTAG 


GTTCGTTT 


5 


.8 


0 


15 


ACCCA 


GTACTTCT 




GAAAAAGA 


1 


.2 


0 


16 


TCACG 


GTATTTTT 


. . . .TTGTC TTCAG 


CCATTAAT 


1 


.0 


I 


17 


GCCTG 


GTAAAGAT 




GTTCATTT 


3 


.0 


I 


18 


CAGAG 


GTATCACA 




ACTGTAGG 


1 


.3 


I 


19 


TCTGG 


GTAAGTTG 


.... CTTGAGTTAG 


GTTACTGG 


1 


.1 


I 


20 


TCCAG 


GTTTGTCT 




GTGGACAT 


0 


.4 


0 


21 


TGATG 


GTTTGTGT 




TATTTTAA 


2 


7 


0 


22 


AAATG 


GTAAGGTA 


.... CTTTTGATAG 


GATGTTCC 


2 


3 


I 


23 


CCAAG 


GTCAAGTA 


. . . .ATTTTCATAG 


GCAAACAA 


2 


8 


0 


24 


ACAAG 


GTAGAGTG 


.... TTTTGGTCAG 


ATTTTTGA 


1 


0 


0 



Consensus 

C A TTTTTT T 

AG GT AGT N AG 

A G CCCCCC C 



"Intron numbers are indicated in ihc first column. The nucleotide sequences of the exon-intron boundaries are shown for each intron. The splice 
donor and acceptor dinuclcotidcs are in boldface type and match the consensus GT...AG except for intron 6. The sizes of inlrons in the human factor 
V gene were delermined by restriction enzyme digestion or PCR. Type 0 indicates a splice between codons, type I and type II indicate a splice after 
the first or second nucleotide of a codon, respectively. The consensus sequence proposed by Mount (1982) is shown at the bottom of the table. 



Table HI: Differences in Nucleotide and Predicted Amino Acid 
Sequences among Factor V cDNA and Genomic Clones 0 









Kane et 




nucleotide 


genomic 


Jenny et al. 


al. cDNA 




no. 


codon 


cDNA codon 


codon 


amino acid 


481 


CGC 


GCG 


GCA 


Ala 


628 


TCC 


TCG 


TCT 


Ser 


2377 


TCA 


TCG 


TCG 


Scr 


2649 


AAA 


AGA 


AGA 


Lys to Arg 


2670 


CAT 


CCT 


CCT 


Mis to Arg 


2849 


AAG 


GAG 


GAG 


Lys to Glu 


3930 


CTT 


CTT 


ATT 


Leu to lie 



"The first column indicates the nucleotide number corresponding to 
the cDNA sequences reported by Kane (Kane & Davie, 1986; Kane et 
al., 1987). The second column indicates the sequence of the codon 
determined from the factor V genomic clones in the present work. The 
nucleotide that differs from previously reported nucleotides is shown in 
bold print. The second and third columns show the codons reported by 
Jenny (Jenny el al., 1987) and Kane (Kane & Davie, 1986; Kane el 
al., 1987). The fifth column indicates the amino acid encoded by the 
genomic sequence followed by the amino acid encoded by a cDNA se- 
quence if different. 



mentary strand of the genomic sequence terminating in an 
upstream Alu\ site (Figure 2). RNAse protection assays with 
a probe that extends 147 bases upstream from the ATG ini- 
tiation codon demonstrate two protected fragments of 97 and 
103 nucleotides. 3 Primer extension assays will be used to 
verify these as authentic transcription start sites. Thus, the 
first exon of the factor V gene encodes a 5' untranslated region 
of 97 or 103 nucleotides, a 28 amino acid leader peptide, and 
the first 24 amino acids of the mature protein. 

Exon-intron Organization of the Human Factor V Gene. 
Comparison with the Factor VI I J Gene and Correlation with 
Protein Domains. The exon-intron organization of the gene 
for human factor V is almost identical to the organization of 
the gene for human factor VIM (Gitschicr ct al., 1984). The 
genes for factors V and VIM arc composed of 25 and 26 exons, 
respectively. The larger mRNA of factor VIII relative to 
factor V is primarily due to the 1.8 kb of 3' untranslated 



L. D. Cripc and W. II. Kane, unpublished observations. 
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sequence in factor VIII. The gene for factor V spans >80 kb 
whereas the gene for factor VIII is considerably larger at 
-180 kb. This difference is largely due to the fact that six 
of the introns in the factor V gene are much smaller than the 
corresponding introns in the factor VIII gene. We have 
previously used the computer programs of Dayhoff to align 
the amino acid sequences for factor V and factor VIII (Kane 
& Davie, 1988). Figure 3 shows the aligned amino acid 
sequences near the exon-intron boundaries. Inspection of the 
corresponding genomic DNA sequences for factor V and factor 
VIII reveals that in 21 of 24 cases the exon-intron boundaries 
occur at precisely the same location in the aligned sequence. 
The 3' boundary of exon 5 of the factor V gene corresponds 
to the 3' boundary of exon 6 in the gene for factor VIII. This 
can be explained either by the gain or the loss of an inlron 
subsequent to the divergence of these two genes. The 3' 
exon-intron boundaries for exon 9 and 13 and the 5' exon- 
intron boundary for exon 14 arc not precisely conserved as 
judged by the protein alignment. These discrepancies may 
either be due to inaccuracies in the protein alignment or al- 
ternatively to the phenomenon of intron drift (Craik et al., 
1983). 

Although the complete structure of the gene for ceruloplasm 
has not been reported, a pseudogene for ccruloplasmin on 
chromosome 8 has been characterized as well as several clones 
for the functional ceruloplasmin gene on chromosome 3 
(Koschinskyetal., 1987; Royle ct al., 1987). Two exon-intron 
boundaries for the ceruloplasmin gene were reported. One 
boundary, corresponding to the 3' region of exon 9 in the factor 
V gene, does not occur at the same position as the boundary 
in either the factor V or factor VIII gene on the basis of the 
protein alignments. The second boundary corresponds to the 
5' end of exon 13 in the factor V gene and exon 14 in the factor 
VIII gene. This boundary occurs at precisely the same location 
in all three genes on the basis of the protein alignments. 

Figure 4 shows the location of introns and protein domains 
for factor V and factor VIM. Within the individual A domains 
of both proteins, the locations of some introns are conserved 
suggesting that they existed in the primordial gene for this 
domain whereas the others were either gained or lost subse- 
quent to the duplication events. The sequences encoding the 
A1-A2 and A2-A3 domain boundaries are each contained 
within a single exon in both genes. There is amino acid 
homology between factor V, factor VIII, and ceruloplasmin 
in the sequences encoded by the 5' and 3' ends of these exons; 
however, there are major differences in the amino acid se- 
quences encoded by the central portion of each exon (Ortel 
et al., 1984; Jenny et al., 1987; Kane et al., 1987). Thus, exon 
8 of the factor VIII gene contains a 33 amino acid insert that 
includes 15 acidic residues which is not present in exon 7 of 
the factor V gene. Futhermore, the exons encoding the domain 
A2-A3 boundaries in the genes for factor V and factor VIII 
are both extremely large and encode the entire B domain for 
the protein. Exon 13 in the factor V gene spans 2820 bp 
whereas the exon 14 in the factor VIII gene is 3106 bp in 
length. The exons encoding the B domain in factor V and 
factor VIII arc much larger in size than the reported average 
size for exons in vertebrate genes of 133 bp (Smith, 1988). 
Recently, however, large exons ranging from 1.5 to 8.7 kb have 
been described in a number of genes or gene families including 
apolipoprolein B (Ludwig cl al., 1987), the large subunit of 
RNA polymerase II (Ahearn et al., 1987), nuclear hormone 
receptor genes (Faber et al., 1989), collagen genes (Muragaki 
et al., 1991), and mucin genes (Liglenberg et al., 1990; To- 
ribara et al., 1991). 
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The B domain of factor V is not homologous to any known 
protein; however, it shares several features with the family of 
mucin glycoproteins (Ligtenberg et al., 1990; Toribara et al., 
1991; Gum et al., 1989; Porchet et al., 1991). These simi- 
larities may be an example of convergent evolution. Thus, the 
genes for mucins and factor V contain large exons which 
encode amino acid sequences that are rich in hydroxyl amino 
acids and proline. These proteins contain tandem repeats of 
short amino acid sequences and appear to be heavily glyco- 
sylated with both O- and N-linked oligosaccharide chains. In 
each of these proteins the amino acid sequences and tandem 
repeat structures appear to be unique. The genes for mucins 
have been shown to be polymorphic, containing a variable 
number of tandem repeats (Ligtenberg et al., 1990; Toribara 
etal., 1991 ; Gum etal., 1 989; Porchet et al., 1991). Thus far, 
this kind of polymorphism has not been observed in the gene 
for human factor V. 4 Factor V contains 3 1 repeats of a nine 
amino acid sequence. One of the cDNA clones that was 
isolated from a HepG2 cDNA library contained an in-frame 
deletion of precisely 1 1 of the tandem nine amino acid repeats 
(Kane & Davie, 1 986; Kane et al., 1 987). It will be of interest 
to determine whether this was an artifact of the cDNA library 
construction or a polymorphism in the human factor V gene. 
On the other hand, the predicted amino acid sequence for 
bovine factor V contains eight fewer nine amino acid tandem 
repeats and only a single copy of the 17 amino acid sequence 
that is duplicated in the human protein (Guinto et al., 1989). 
The B domain of factor VIII is also rich in carbohydrate and 
hydroxyl amino acids; however, it does not contain any tandem 
repeat structures (Toole et al., 1984; Vehar et al., 1984). 

In contrast to the sequences encoding the A1-A2 and A2- 
A3 domain boundaries, the sequences encoding the A3-C1 
and C1-C2 domain boundaries are not each contained within 
a single exon. In these cases the domain boundaries in the 
protein correlate precisely with type I exon-intron boundaries 
in the gene. Recently, Stubbs et al. (1990) cloned the cDNA 
for a murine breast epithelial cell protein of unknown function 
that consists of a single epidermal growth factor like domain 
followed by the two C-type domains. The C domains in the 
epithelial cell protein are approximately 40% identical to the 
C domains in factor V and factor VIII. The structure of the 
gene encoding this protein is not known. On the basis of the 
amino acid sequence homology, we predict that the exon-in- 
tron organization of this gene will be very similar to the genes 
for factor V and factor VIII. 

Evolutionary and Functional Considerations. The function 
of the large connecting region or B domains in factor V and 
factor VIII remains poorly understood. Both proteins circulate 
in plasma as relatively inactive cofactors. Activation of both 
molecules by thrombin yields an active species comprising a 
calcium-dependent helerodimer composed of heavy and light 
chains. The heavy chains contain domains A1-A2, whereas 
the light chains contain domains A3-C1-C2. The B domains 
contain all three of the thrombin cleavage sites in factor V and 
two of the three thrombin cleavage sites in factor VIII. 
Following thrombin activation, the B domains of both proteins 
are released as very large activation peptides comprising >40% 
of the mass of the original molecule. Recent ultrastructural 
data obtained using electron microscopy suggests that the B 
domains of factor V and factor VIII form a rodlike tail ex- 
tending from a globular domain containing the heavy chain 
and light chain (Moscsson et al., 1990a,b). The B domains 
of factor V and factor VIII arc not required for generation 



4 L. O. Cripcand W. 1 1. Kane, unpublished observations. 
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Factor V Gene Structure 
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figure 3: Comparison of exon-intron junction sequences of the genes for factors V and VIII. The amino acid sequences of human factor 
V and factor VIII were aligned by the method of Dayhoff as previously reported (Kane & Davie, 1988). The first line of each comparison 
is the genomic sequence for factor V determined in the present work. The second line is the corresponding translated amino acid sequence 
for factor V. The third line is the translated amino acid sequence for factor VIII which has been aligned to the factor V amino acid sequence. 
The fourth line in each series is the corresponding sequence for the factor Vllf gene determined by Gitschier (Gitschier et al., 1984). Genomic 
exon sequences arc shown as triplets, and the translated amino acid sequence for split codons is indicated by the split in the three-letter amino 
acid code. The number for each exon in the factor V and the factor VIII genes is indicated. Exon 5 in the factor V gene contains the sequences 
corresponding to exons 5 and 6 in the factor VIII gene. 
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of procoagulant activity since deletion of a large portion of 
these regions in recombinant proteins does not reduce the 
procoagulant activity of either molecule (Toole et al., 1986; 
Kane et al., 1990). However, these observations do not exclude 
other functions for the B domains. Thus, the factor V deletion 
mutant appeared to have increased constitutive procoagulant 
activity suggesting that the entire B domain is necessary to 
inhibit cofactor activity (Kane el al., 1990). Complete deletion 
of the B domain results in a molecule with markedly dimin- 
ished activity secondary to reduced rates of proteolytic acti- 
vation by thrombin (Pittman et al., 1990). Furthermore, the 
B domain of factor VIII serves a unique function in that it 
contains the binding site for von Willebrand factor (Pittman 



& Kaufman, 1989). The von Willebrand factor-factor VIII 
interaction is required for the stabilization of factor VIII in 
vitro as well as in vivo (Kaufman et al., 1988). 

Because of the large size of the exons encoding the B do- 
mains in factor V and factor VIII, we speculate that these 
exons arose through the insertion of a processed gene derived 
from reverse transcription of an mRNA. It is possible that 
the exons encoding the B domains of both factor V and factor 
VIII are derived from a single primordial cofactor gene. In 
this case, the lack of amino acid homology between the two 
cofactors would be due to divergent evolution. Analysis of the 
amino acid sequences for factor V and factor VIII from dif- 
ferent species suggests that there has been relatively little 
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selective pressure to conserve amino acid sequences in the B 
domains. Thus, the predicted amino acid sequence for the B 
domain of bovine factor V is only 59% identical to the cor- 
responding sequence of the human protein whereas there is 
88% amino acid sequence identity in the heavy chain and light 
chain regions (Guinto el al., 1989). Similar divergence has 
been observed between human and porcine factor VIII se- 
quences. Toole et al. (1986) reported -50% amino acid 
identity in the B domain region whereas there was 80-85% 
amino acid identity in the heavy-chain and light-chain regions. 
Divergence within the B domain of human factor V is further 
suggested by the fact that the five discrepancies between re- 
ported genomic and cDNA sequences that result in missense 
mutations all occur within the B domain. While the available 
data are consistent with a single insertion event in the pri- 
mordial cofactor gene followed by divergent evolution, it is 
also possible that two separate insertion events occurred 
subsequent to the divergence of the genes for factor V and 
factor VIII. An insertion event also appears to have occurred 
in the A1-A2 exon in the factor VIII gene which codes for 
an additional 33 amino acids that are not present in the 
corresponding region of factor V or ccruloplasmin. Charac- 
terization of related genes in primitive species may further 
clarify the evolution of this family of related genes. 

Characterization of the gene for human factor V provides 
the groundwork for examining natural mutations that affect 
factor V function and result in the rare bleeding disorder 
congenital factor V deficiency. Functional characterization 
of these defects should provide insight into the regulation of 
the prothrombinase complex. The major sites of synthesis of 
factor V appear to be the hepatocyte and megakaryocyte 
(Wilson et al., 1984; Gcwirtz et al M 1986). The acquired factor 
V deficiency associated with hepatic failure and the correlation 
of clinical severity with platelet factor V levels in parahemo- 
philia indicate the importance of factor V synthesis at these 
sites (Milctich et aL, 1978). Factor V has also been reported 
to be synthesized in other cell types including monocytes 
(Altieri & Edgington, 1989), endothelial cells (Shen & Edg- 
ington, 1991), T-cclls (Shen et al., 1990), and vascular smooth 
muscle cells (Rodgers, 1988). The importance of cellular 
surfaces in regulating blood coagulation suggests a possible 
role for factor V associated with these cells. Characterization 
of the regulatory elements of the factor V gene may help 
explain the tissue-specific expression of this protein arid the 
significance of cxtrahepatic synthesis in physiological and 
pathophysiological slates. 
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austract: Bovine liver glutamate dehydrogenase reacts with the bifunctional affinity label 5'-(/?-(fluo- 
rosulfonyl)bcnzoyl)-8-azidoadenosine (5'-FSBAzA) in a two-step process: a dark reaction yielding about 
0.5 mol of -SBAzA/mol of subunit by reaction through the fluorosulfonyl moiety, followed by photoactivation 
of the azido group whereby covalently bound -SBAzA becomes cross-linked to the enzyme [Dombrowski, 
K. E., & Colman, R. F. (1989) Arch. Biochem. Biophys. 275, 302-308]. We now report that the rate constant 
for the dark reaction is not reduced by ADP or GTP, but it is decreased 7-fold by 2 mM NADH and 40-fold 
by 2 mM NADH + 0.2 mM GTP, suggesting that 5'-FSBAzA reacts at the GTP-dependent NA.DH 
inhibitory site. The amino acid residues modified in each phase of the reaction have been identified. Modified 
enzyme was isolated after each reaction phase, carboxymethylated, and digested with trypsin, chymotrypsin, 
or thermolysin. The digests were fractionated by chromatography on a phenylboronate agarose column 
followed by HPLC. Gas-phase sequencing of the labeled peptides identified Tyr 190 as the major amino acid 
which reacts with the fluorosulfonyl group; Lys 143 was also modified but to a lesser extent. The predominant 
cross-link formed during photolysis is between modified Tyr 190 and the peptide Leu 475 -Asp 476 -Leu 477 -Arg 478 , 
which is located near the C-terminus of the enzyme. Thus, 5'-FSBAzA is effective in identifying critical 
residues distant in the linear sequence, but close within the regulatory nucleotide site of glutamate de- 
hydrogenase. 



^Bovine liver glutamate dehydrogenase [L-glutamate:NAD- 
(P) + oxidoreductase (deaminating), EC 1.4.1.3] is an allosteric 
enzyme that is activated by ADP, but inhibited by GTP and 
high concentrations of NADH. The enzyme in its smallest, 
active form is a hexamer of six identical subunits (Goldin & 
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Frieden, 1972; Julliard & Smith, 1979). For each subunit 
there are six binding sites for catalytic and regulatory purine 
nucleotides: one catalytic site that binds either NAD(H) or 
NAD(P)H (Goldin & Frieden, 1972), one additional co- 
enzyme site which is regulatory (Krause et al., 1974), two ADP 
sites (Batra & Colman, 1986a), and two GTP binding sites 
as measured in the presence of NADH (one of high affinity 
and one of low affinity) (Pal & Colman, 1979). 

Considerable information regarding the amino acid residues 
within the catalytic and regulatory sites of bovine liver glut- 
atmate dehydrogenase has come from affinity labeling ex- 
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